Publish Consistency and Producer-Consumer Locking: Efficiently Distributing Updates to Web Servers Using a File System

نویسندگان

  • Randal C. Burns
  • Robert M. Rees
  • Darrell D. E. Long
  • Randal Burns
چکیده

For low-latency access and scalability, Web site replication with a distributed file system has advantages over proxy caching. In particular, file systems encapsulate data consistency and recovery from failure behind a simple interface so that Web servers and Internet applications can operate as if they were on a single system. However, distributing content changes to many Web servers using a file system can heavily load the network between the servers, resulting in added latency. Poor performance arises in this environment because the Web-serving workload differs radically from the assumed file system workload. To address this shortcoming of file systems, we introduce the publish cache consistency model well suited to the Web-serving workload, which relaxes cache-consistency constraints slightly in order to enhance performance. We implement publish consistency in the producer-consumer locking protocol, in which the file system pushes content updates to Web servers. A comparison of this protocol against other file system protocols by simulation shows that producer-consumer locking removes almost all latency due to protocol overhead and significantly reduces network load. The work of this author was performed while a Visiting Scientist at the IBM Almaden Research Center.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficiently Distributing Data in a Web Server Farm

We present a cache consistency model and locking protocol customized for file systems that are used as a scalable infrastructure for Web server farms. The protocol takes advantage of the relaxed consistency semantics of the Web to reduce latencies and network overhead. Ours is a hybrid approach that preserves strong consistency for concurrent write sharing with time-based consistency and push c...

متن کامل

Cruz Efficient Data Distribution in a Web Server Farm

High-performance Web sites rely on Web server “farms”—hundreds of computers serving the same content—for scalability, reliability, and low-latency access to Internet content. Deploying these scalable farms typically requires the power of distributed or clustered file systems. Building Web server farms on file systems complements hierarchical proxy caching.1 Proxy caching replicates Web content ...

متن کامل

Efficient Data Distribution in a Web Server Farm

High-performance Web sites rely on Web server “farms”—hundreds of computers serving the same content—for scalability, reliability, and low-latency access to Internet content. Deploying these scalable farms typically requires the power of distributed or clustered file systems. Building Web server farms on file systems complements hierarchical proxy caching.1 Proxy caching replicates Web content ...

متن کامل

Distributing the Data Plane for Remote Storage Access

Sub-microsecond network and memory latencies require fast user-level access to local and remote storage. While user-level access to local storage has been demonstrated recently, it does currently not extend to serverless parallel systems in datacenter environments. We propose direct user-level access to remote storage in a distributed setting, unifying fast data access and high-performance remo...

متن کامل

Corona: A High Performance Publish-Subscribe System for the World Wide Web

Despite the abundance of frequently changing information, the Web lacks a publish-subscribe interface for delivering updates to clients. The use of naı̈ve polling for detecting updates leads to poor performance and limited scalability as clients do not detect updates quickly and servers face high loads imposed by active polling. This paper describes a novel publish-subscribe system for the Web c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006